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1 Introduction 



In this paper we estabhsh a commutator estimate which allows one to concretely identify the 
product BMO space, BMO{E?;^ x M^), of Chang and R. Fefferman, as an operator space on 
L^(M^). The one parameter analogue of this result is a well-known theorem of Nehari [8]. 
The novelty of this paper is that we discuss a situation governed by a two parameter family 
of dilations, and so the spaces and BMO have a more complicated structure. 

Here denotes the upper half-plane and i?MO(M^ x M^) is defined to be the dual of the 
real- variable Hardy space on the product domain x M^. There are several equivalent 
ways to define this latter space and the reader is referred to [5] for the various characteriza- 
tions. We will be more interested in the biholomorphic analogue of H^, which can be defined 
in terms of the boundary values of biholomorphic functions on X and will be denoted 
throughout by H^{^\ x M^), cf [10]. 



In one variable, the space L^(M) decomposes as the direct sum if^(M) ©iJ^(R) where if^(M) 
is defined as the boundary values of functions in if^(R^) and if2(R) denotes the space of 
complex conjugate of functions in if^(]R). The space L^(M^), therefore, decomposes as the 
direct sum of the four spaces H'^ {^) ® {R) , H^{R)^H^{R), H^{R) 0H'^{R) and H'^{M.)® 
H'^{R) where the tensor products are the Hilbert space tensor products. Let P±,± denote 
the orthogonal projection of L^(R^) onto the holomorphic/anti-holomorphic subspaces, in 
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the first and second variables, respectively, and let Hj denote the one-dimensional Hilbert 
transform in the j*'* variable, j — 1,2. In terms of the projections -P±,±, 

//i = + /V,- - - and //2 = + - - 

The nested commutator determined by the function b is the operator [[Mb, Hi], H2] acting 
on L^(R^) where, for a function b on the plane, we define M^f :— bf. 

In terms of the projections P±,±, it takes the form 

(1.1) |[[M,, Hi], H,] = P+,+M,P_,_ - P+,-MbP-,^ - P-,+MbP+,- + P-,-MbP+,+. 

Ferguson and Sadosky [4] established the inequality ||[[Mfe, iJi], iJ2]||L2 < c||6||bmo- The 
main result is the converse inequality. 

1.2. Theorem. There is a constant c > such that ||6||smo < c||[[Mft, ifi], if2]||L2^L2 for 
all functions b in BMO{Rl x Rl). 

As A. Chang and R. Fefferman have estabhshed for us, the structure of the space BMO 
is more complicated in the two parameter setting, requiring a more subtle approach to this 
theorem, despite the superficial similarity of the results to the one parameter setting. The 
proof relies on three key ideas. The first is the dyadic characterization of the BMO norm 
given in [1]. The second is a variant of Journe's lemma, [6], (whose proof is included in the 
appendix.) The third idea is that we have the estimates, the second of which was shown in 
[4], 

\\0\\BMO{rect) 

<c\\[[Mb,Hi],H2]\\L2^L^<c'\\b\\ 

BMO- 

An unpublished example of L. Carleson shows that the rectangular BMO norm is not com- 
parable to the BMO norm, [3]. We may assume that the rectangular BMO norm of the 
function b is small. Indeed, this turns out to be an essential aspect of the argument. 

Prom theorem 1.2 we deduce a u;eaA; factorization for the (biholomorphic) space if^(]R^ xR^). 
The idea is that if the function b has biholomorphic extension to x then for functions 

f^geL^R^), 

l{[[Mb,Hi],H,]f,g) = {b,PrifP+,^g). 
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So in this case, the operator norm of the nested commutator [[M^, Hi], is comparable to 
the dual norm 

\\b\U:=sup\{fg,b)\ 

where the supremum above is over all pairs f,g in the unit ball of //^(M^ x M^). On the 
other hand, since ||6||bmo and ||[[Mft, ifi], if2]||L2^i,2 are comparable, the dual norm above 
satisfies 

~ sup|(/i,6)| 

where the supremum is over all functions h in the unit ball of i7^(]R^ x R^). 

1.3. Corollary. Let h be in if^(M^ x M^) with \\h\\i = 1. Then there exists functions 
(/,), {gj) C H\Rl X Rl) such that h = ET=ifj9j and ET=i\\fM9j\\2 < c. 



We remark that the weak factorization above implies the analogous factorization for of 
the bidisk. Indeed, for all 1 < p < oo, the map Up : Hp{RI x M^) Hp{ID'^) defined by 



is an isometry with isometric inverse 

1 1 .... X-i 



{u;'g){z,w)^7r-'/P^-^^j (^^j g{P{z),P{w)) /3(A) 



X + i 



The dual formulation of weak factorization for i^^(D^) is a Nehari theorem for the bidisk. 
Specifically, if 6 e if^(D^) then the little Hankel operator with symbol b is densely defined 
on if2(D2) formula 

r,/ = p_,_(6/). 

By (1.1), llr^ll = II [[M|^, iJi], iJ2] II and thus, by theorem 1.2, ||r;,|| is comparable to 
ll^llsMO which, by definition, is just the norm of b acting on //-"^(D^). So the boundedness of 
the Hankel operator Tb implies that there is a function e L°°(T^) such that P+,+4' = b. 

Several variations and complements on these themes in the one parameter setting have been 
obtained by Coifman, Rochberg and Weiss [2]. 
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The paper is organized as follows. Section 2 gives the one-dimensional preliminaries for the 
proof of theorem 1.2 and Section 3 is devoted to the proof of theorem 1.2. The appendix 
contains the variant of Journe's lemma we use in our proof in Section 3. One final remark 
about notation. A ^ B means that there is an absolute constant C for which A < CB. 
Ak. B means that A<B and B < A. 

We are endebited to the anonymous referee. 



2 Remarks on the one dimensional case 

Several factors conspire to make the one dimensional case easier than the higher dimensional 
case. Before proceeding with the higher dimensional case, we make several comments about 
the one dimensional case, comments that extend and will be useful in the subsequent section. 

Let H denote the Hilbert transform in one variable, P+ = \{I -\- H) be the projection of 
L^(]R) onto the positive frequencies, and P_ is ^{I — H), the projection onto the negative 
frequencies. We shall in particular rely upon the following basic computation. 

(2.1) |[Mb, H]b = P_|P_6|' - P+\P+b\^. 

The frequency distribution of |-P-6p is symmetric since it is real valued. Thus, 

\\b\\l:<\\P^\P^bf-P+\P+b\% 
<\\[M,,H]h^4bh 

Moreover, if b is supported on an interval /, we see that 

ii%<i/r/ii&ii4^i/r/ii[M,,i/]ii^^,ii6iif 

which is the BMO estimate on I. We seek an extension of this estimate in the two parameter 
setting. 

We use a wavelet proof of theorem 1.2, and specifically use a wavelet with compact frequency 
support constructed by Y. Meyer [7] . There is a Schwarz function w with these properties. 

• ||w||2 = 1. 
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• w{^) is supported on [2/3, 8/3] together with the symmetric interval about 0. 

• P±w is a Schwartz function. More particularly, we have 

\w{x)\,\P±w{x)\^{l+\x\y, n>l. 

Let T> denote a collection of dyadic intervals on R. For any interval /, let c{I) denote it's 
center, and define 



Wi{x) := 




Set wf := P±wj. The central facts that we need about the functions {wj : / e V} are 
these. 

First, that these functions are an orthonormal basis on L^(]R). Second, that we have the 
Littlewood-Paley inequalities, valid on all L^, though p = 4 will be of special significance 
for us. These inequalities are 

(2.2) II/IIp^ 

Third, that the functions wj have good locahzation properties in the spatial variables. That 
is, 

(2.3) \wi{x)\, \wf{x)\ ^ \I\-'/\i{xr, n>l, 

where X/(.t) := (1 + dist(x, /)/|/|)~^. We find the compact localization of the wavelets in 
frequency to be very useful. The price we pay for this utility below is the careful accounting 
of "Schwartz tails" we shall make in the main argument. Fifth, we have the identity below 
for the commutator of one wj with a wj. Observe that since P+ is one half of / + if, it 
suffices to replace H by P+ in the definition of the commutator. 

wi,j := [w/, P+]wJ 

— wiWj — P+wiWj 

— P^WjWj — P+WjWj 

— P-WjWj — P+W'^Wj 



1/2 



1 < p < OO. 
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{0 if|7|>4|J|. 
P_|w7|2-P+|^/;+|2 if 7= J. 
wjwj — wfwj if |"^| ^ 4|7|. 

From this we see a useful point concerning orthogonality. For intervals 7, 1', J and J', assume 
I J| > 8|7| and likewise for I' and J'. Then 

(2.5) supp(w7» n supp(w;r7,) = 0, |7'| > 8|7|. 

Indeed, this follows from a direct calculation. The positive frequency support of w/wj is 
contained in the interval [(3|7|)~-'^, 8(3|7|)~-'^]. Under the conditions on 7 and 7', the frequency 
supports are disjoint. 



3 Proof of the main theorem 



BMO{M.\ X M^) will denote the BMO of two parameters (or product BMO) defined as the 
dual of {real) 77^ (M^ x M^). The following characterization of the space BMO{^\ x E^) is 
due to Chang and R. Fefferman [1]. 

The relevant class of rectangles is TZ — T> x T>, all rectangles which are products of dyadic 
intervals. These are indexed by R For such a rectangle, write it as a product Ri x R2 
and then define 



Vr{xi,X2) = 'Wr^{xi)wr^{x2). 



A function / e BMO{Rl x Rl) iff 



sup 

u 



RCU 



1/2 



< 00. 



Here, the sum extends over those rectangles R and the supremum is over all open sets 
in the plane of finite measure. Note that the supremum is taken over a much broader class 
of sets than merely rectangles in the plane. We denote this supremum as ||/||bmo- 



In this definition, if the supremum over U is restricted to just rectangles, this defines the 
"rectangular BMO" space, and we denote this restricted supremum as ||/||BMO(rec)- 
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Let us make a further comment on the BMO condition. Suppose that for R E TZ, we have 
non-negative constants an for which 



RcU 



for all open sets U in the plane of finite measure. Then, we have the John-Nirenberg 
inequality 



RcU 



^ \U\^/P, l<p< 



oo. 



See [1]. This, with the Littlewood-Paley inequalities, will be used several times below, and 
referred to as the John-Nirenberg inequalities. 



3.1 The Principal Points in the Argument 

We begin the principle line of the argument. The function b may be taken to be of Schwarz 
class. By multiplying 6 by a constant, we can assume that the BMO norm of 6 is 1. Set 
52_>2 to be the operator norm of [[Mh, Hi], H2]. Our purpose is to provide a lower bound for 
B2-^2- Let U be an open set of finite measure for which we have the equality 

J2\(b,vn)\' = \U\. 

RcU 

As b is of Schwarz class, such a set exists. By invariance under dilations by a factor of two, 
we can assume that ^ < \U\ < 1. In several estimates below, the measure of U enters in, a 
fact which we need not keep track of. 

An essential point is that we may assume that the rectangular BMO norm of b is at most 
e. The reason for this is that we have the estimate ||6||BMO(rec) ^ -82^2- See [4]. Therefore, 
for a small constant e to be chosen below, we can assume that ||&||BMO(rec) for otherwise 
we have a lower bound on B2-^2- 

Associated to the set U is the set V, defined below, which is an expansion of the set U. In 
defining this expansion, it is critical that the measure of V be only slightly larger than the 
measure of U, and so in particular we do not use the strong maximal function to define this 
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expansion. In the definition of V, the parameter S > will be specified later and Mj is the 
one dimensional maximal function applied in the direction j. Define 



Vij := {Mil{Mjiu>5} > S}, 
V:^Vi2UV2i, 
lx{R) := sup{^ : ixR^V} RgU. 

The quantity measures how deeply a rectangle R is inside V. This quantity enters into 
the essential Journe's Lemma, see [6] or the variant we prove in the appendix. 

In the argument below, we will be projecting b onto subspaces spanned by collections of 
wavelets. These wavelets are in turn indexed by collections of rectangles. Thus, for a 
collection A'^TZ, let us denote 

ReA 

The relevant collections of rectangles are defined as 

U := {Ren : RCU}, 
V = {ReTZ-U : i? C V}, 

n; = 7^-w- V. 

For functions / and g, we set {/, g} :— [[M/, Hi],H2]g. 

We will demonstrate that for all 5, e > there is a constant > so that 

(i) ||{6^6"}|h^5V8 
(ii) ||{6^,6"}||2<i^56V3 



Furthermore, we wiU show that 1 ^ ||{&", &"}||2- Since 6 = 6" + 6^ + , W^h di 1 and 
5, e > are arbitrary, a lower bound on -82^2 will follow from an appropriate choice of 5 and 
e. To be specific, one concludes the argument by estimating 

1^||{6^6"}||2 
^ ||{6" + 6V,6"}||2 + (5i/^ 
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Implied constants are absolute. Choosing 5 first and then e appropriately small supplies a 
lower bound on S2_»2- 



The estimate 1 ^ ||{&", i^}\\2 rehes on the John-Nirenberg inequality and the two parameter 
version of (2.1), namely 

-61^ - P+._|P4-.-&P - P-,+|P- + P_ _|P_ 



[[M,,//i],i/2]& = P+,+ |P+,+. 



This identity easily follows from the one- variable identities. Here P±,± denotes the projection 
onto the positive/negative frequencies in the first and second variables. These projections 
are orthogonal and moreover, since |P-[_ .t6p is real valued we have that ||P±,±|P±,±&P||2 > 
ill |-P±,±^PI|2- Therefore, ||6^||4 ^ ||{^^, ^^jlh- It follows by the John-Nirenberg inequality 
that 



1 i ll*"!!: 

E 

.Reu 



\{b,VR)\' 



1/2 



.Reu 



1/2 



^ \\b' 114 



,W||2 



^ll{&",&"}||2 



The estimate (i) relies on the fact that the one dimensional maximal function maps into 
weak with norm one. Thus, for all < (5 < 1/2, 

\{M2l{M,iu>i-s} > 1 - 5}| < (1 - S)-^\U\ < (1 + 6S)\U\. 
Now, if P e V, then R G V and since b has BMO norm one, it follows that 

|t^l + Il&''ll2 = E \{h.VR)\'<{l + &5)\U\. 

Reuuv 

Hence ||6^||2 :< 5^/^ Yet the BMO norm of can be no more than that of b, which is to 
say 1. Interpolating norms we see that ||&^||4 ^ 6^^'^, and so 

||{&^6"}||2^ ||6^||4||6"||4^5'/'. 
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3.2 Verifying the Estimate (ii) 

We now turn to the estimate {ii). Roughly speaking and hve on disjoint sets. But in 
this argument we are trading off precise Fourier support of the wavelets for imprecise spatial 
localization, that is the "Schwartz tails" problem. Accounting for this requires a careful 
analysis, invoking several subcases. 

A property of the commutator that we will rely upon is that it controls the geometry of R 
and R'. Namely, {f^', Vfi} 7^ iff writing R = Ri x R2 and likewise for R' , we have for both 
j — 1,2, \R'j\ < 4:\Rj\. This follows immediately from our one-dimensional calculations, 
in particular (2.4). We abbreviate this condition on it! and R' as R' ^ R and restrict our 
attention to this case. 

Orthogonality also enters into the argument. Observe the following. For rectangles R^^Rk, 
k^l,2, with R'' ^ and for j = 1 or j = 2 

(3.1) if 8|i?]| < |/?|| and 8\R^j \ < \R^j \ then {vj^^v^^v^^v^) = 0. 

This follows from applying (2.5) in the jth coordinate. 

Therefore, there are different partial orders on rectangles that are relevant to our argument. 
They are 

• R' <RiS 8\R'j\ < \Rj\ for j = 1 and j = 2. 

• For j = 1 or j = 2, define R' <j R iS R' di R and 8\R'j\ < \Rj\ but R' ^ R. 

• R' c^RiS \\Rj\ < \R'j\ < \Rj\ for j = 1 and j = 2. 

These four partial orders divide the collection {[R',R) : R' G W, R R' di R] into 

four subclasses which require different arguments. 

In each of these four arguments, we have recourse to this definition. Set lAk, for A; = 0, 1, 2, . . . 
to be those rectangles in U with 2~^~^ < jiiJAk) < 2*^. 

Journe's Lemma enters into the considerations. Let W C Wfe be a collection of rectangles 
which are pairwise incomparable with respect to inclusion. For this latter collection, we have 
the inequality 



(3.2) 




R&A' 
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See Journe [6], also see the appendix. This together with the assumption that b has small 
rectangular BMO norm gives us 

(3.3) Wh^'^WsMO < Ks2'"'''e. 

This interplay between the small rectangular BMO norm and Journe's Lemma is a decisive 
feature of the argument. 

Essentially, the decomposition into the collections Uk is a spatial decomposition of the col- 
lection U. A corresponding decomposition of W enters in. Yet the definition of this class 
differs slightly depending on the partial order we are considering. 

For i?' e W and R & 14 the term {v^\v^} is a linear combination of 

VR'HiHiV^, H2{vr>HiV^), {HiVr'){H2V^), HiH2{vr'V^). 

Consider the last term. As we are to estimate an norm, the leading operators H1H2 
can be ignored. Moreover, the essential properties of wavelets used below still hold for the 
conjugates and Hilbert transforms of the same. These properties are Fourier locahzation and 
spatial localization. Similar comments apply to the other three terms, and so the arguments 
below applies to each type of term seperately. 



3.2.1 The partial order '<' 

We consider the case ol R' < R for i?' e W and R eU. The sums we considering are related 
to the following definition. Set 



Vun( 



X 



sup 

R' 



vr{x 



ReUk 

R'<R 



Note that we consider the maximal truncation of the sum over all choices of dimensions of 
the rectangles in the sum. Thus, this sum is closely related to the strong maximal function 
M applied to 6"*=, so that in particular we have the estimate below, which rehes upon (3.3). 



II til, ^ 62^/1°°, l<p<oo. 
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(By a suitable definition of tfic strong maximal function M, one can deduce this inequality 
from the V bounds for M .) We apply this inequality far away from the set U . For the set 
W\ = — UiieWfc A > 1, we have the inequality 

(3.4) Wb'tUUnw) ^ e2'"''\-''\ 1< p < oo. 



We shall need a refined decomposition of the collection W, the motivation for which is the 
following calculation. Let W C W. For n = (711,712) G I?, set W^n) := {R' e W : \R'j\ = 
2"^ j = 1,2}. In addition, let 

S(W,7i):= J] X! 



R'<R 



And set B{W) = Y.n&^ B{W,n). 



Then, in view of (3.1), we see that B{W',ri) and S(W",7i') are orthogonal if n and n' differ 
by at least 3 in either coordinate. Thus, 



2 . _ 

neZ2 neZ2 



5]5(W',7i) <3 5^||i?(>V',7^)||^ 



The rectangles R' e W(7i) are all translates of one another. Thus, taking advantage of the 
rapid spatial decay of the wavelets, we can estimate 



"^trun 



2 

dx 



In this display, we let x(2^i?^2) = (1 + + ^2) rectangles R, Xi?(xi,X2) = 

x(a;i I i?i I ~^ , X2 1 i?2 1 ~^) ■ Note that xr depends only on the dimensions of R and not its location. 

Continuing, note the trivial inequality J {xr * fYg dx -< j\f\'^XR * 9 dx. We can estimate 

\\BiW)\\l ^ Yl \{b,VR')\'{\RV fM{\b^U') dx} 

R'ew 

(3.5) ^ I I J i?' sup avg(i?')- 



R'ew 



R'&W 
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Here we take avg(i?') := \R'\~^ M(|6^*„^p). 

The terms avg(i?') are essentially of the order of magnitude times a the scaled distance 
between R' and the open set U. To make this precise requires a decomposition of the 
collection W. 

For integers I > k and m > 0, set W(Z, m) to be those R' E W which satisfy these three 
conditions. 

• First, avg(i?') < 62"^' if m = and £2-^'+"*-^ < avg(i?') < 62"^'+"^ if m > 0. 

• Second, there is an ReUk with R' <R and R' C 2^+^R. 

• Third, for every R e Uk with R' < R, we have R' <f_ 2^^^R. Certainly, this collection 
of rectangles is empty if Z < A;. 

We see that 

I U i?' ^ min(2''f,2-"*^'/2), 1< p < oo. 

R'ey\!{l,jn) 

The first estimate follows since the rectangles R' e W(Z, m) are contained in the set {Mlu > 
2-2/-i| rpjjg second estimate follows from (3.4). 

But then from (3.5) we see that for m > 0, 

\\B{W{l,m))\\l ^ £2-^'+"^ min(22'P, 2-'"f/2) ^ e2-("+')/^°. 

In the case that m = 0, we have the bound 2^'**. This is obtained by taking the minimum 
to be 2^^^ for p = 5/4 and < m < For m > take the minimum to be 2"'"^'/^ with 
p = 4. 

This last estimate is summable over < k < I and < m to at most z< e, and so completes 
this case. 



3.2.2 The Partial Orders '</, j = 1, 2. 

We treat the case of i?' <i R, while the case of R' <2 R is same by symmetry. The structure 
of this partial order provides some orthogonality in the first variable, leaving none in the 
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second variable. Bounds for the expressions from the second variable are derived from a 
cognate of a Carleson measure estimate. 

There is a basic calculation that we perform for a subset W C W. For an integer n' e Z 
define W'{n') := {R' e W : \R[\ = 2"'}, and 

R'<iR 

RecaUing (3.1), if n' and n" differ by more than 3, then S(W", n') and B(W', n") are orthog- 
onal. 

Observe that for R' and R as in the sum defining i?(W", n), we have the estimate 
(3.6) \vr\x)v^{x)\ < (|i?||i?'|)-i/Mist(i?',i?)i°°V' * lii'(^), ^ e 

In this display, we are using the same notation as before, x{^ii^2) = {l + x^ +^2)"^'' and for 
rectangles R, Xr{^1j^2) = x{^i\Ri\~^^^2\R2\'^)- In addition, dist(i?',i?) := M1r{c{R')), 
with c{R') being the center of R'. [This "distance" is more properly the inverse of a distance 
that takes into account the scale of the rectangle R.] 



Now define 
(3.7) 



f3{R')-= E \Rr'^'\{b,VR)\dist{R',R) 



1000 



Reu 
R'<iR 



The main point of these observations and definitions is this. For the function i^(W') :— 
En'ez^(W,^'), wehave 



\B\\l^ J2\\B{W,n' 
n'ez 

n'GZ 



J2 \{b,vnmR')\Rr'^'xR'*iR' 

,R'GW'(n') 

2 



dx 



\{b,vnmR')\RV^'U' 

.R'eW'in') 



dx 



At this point, it occurs to one to appeal to the Carleson measure property associated to 
the coefficients \ {b,VR')\\R'\~^/'^. This necessitates that one proves that the coefficients P{R') 
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satisfy a similar condition, which doesn't seem to be true in general. A slightly weaker 
condition is however true. 



To get around this difficulty, we make a further diagonalization of the terms f3(R') above. 
For integers i' > i^o, > 1 and a rectangle R' e W, consider rectangles R such that 



R' <i R, < dist(i?', R)<2 



-v+l 



2''\R'\ = \R\. 



[The quantity vq depends upon the particular subcoUection W' we are considering.] We 
denote one of these rectangles as 7r{R'). 

An important geometrical fact is this. We have n{R') C 2''+''+^°i?i x 2''+^°i?2- 
particular, this last rectangle has measure ^ 2^^+^|i?'|. 

Therefore, there are at most 0(2'^'") possible choices for n{R'). [Small integral powers of 2'" 
are completely harmless because of the large power of dist(i?', R) that appears in (3.7).] 

Our purpose is to bound this next expression by a term which includes a power of e, a small 
power of 2^ and a power of 2~^. Define 



n'ez 



/, .^.x, -XR' * ^R' 



^ \^ f \{b,VH'){b. v^(R'))\ ^ 

^ \{b,VR'){b, Vn(R'))\ 

^ ^ /I D/l \^( D/M 2-^ 



dx 



dx 



\R" 



n'ez R'eW'{n') 



R"CR' 



— 1(6, VR»){b, v^(^R.))\ 



The innermost sum can be bounded this way. First ||6|| BMO(r-ec) < so that 

Y: \{b,VR.)\'<e'\R'\. 



R"<ZR' 



Second, by our geometrical observation about 7i{R'), 
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In particular, the factor 2" does not enter into this estimate. 
This means that 



\R' 



R'GW ReUk 
1 1/2 



1/2 



R'ew 



The point of these computations is that a further trivial application of the Cauchy-Schwartz 
inequality proves that 



|5(W')||2 ^e2-^°°'^°| U R' 



R'eW 



1/4 



where pq is the largest integer such that for all R' E W and e W^, we have dist(i?', R) < 

Wc shall complete this section by decomposing W into subcoUcctions for which this last 
estimate summable to e2~''. Indeed, take W„ to be those R' E W with R' (f_ 2^R for all 
ReUk with R' <i R. And there is an i? e Uk with R' C 2''+^it! and i?' <i i?. Certainly, we 
need only consider v > k. 

It is clear that this decomposition of W will conclude the treatment of this partial order. 



3.2.3 The partial order '~' 



We now consider the case of R' ~ R, which is less subtle as there is no orthogonality to exploit 
and the Carleson measure estimates are more directly applicible. We prove the bound 



^ {b,VR>){b,VR)VR>VR 



R'ew Reu 

R'^R 
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The diagonalization in space takes two different forms. For A > 2^^ and R set (t(A, R) 
to be a choice of i?' G W with R' c:^ R and R' C 2XR. (The definition is vacuous for A < 2^.) 
It is clear that we need only consider ~ A^ choices of these functions ^(A, ■) : Uk — > W. 
There is an estimate which allows one to take advantage of the spatial separation between 
R and a{X, R). 



{b, Va(\,R)) {b, VR)Va{X,R)VR ^ A | (6, Va(x,R)) {b, Vr) \ 

ReUk 1 ReUk 

^X-'''[J2\{b,v^^,,R))\'J2\{b,VR) 
ReUk ReUk 



1/2 



This estimate uses (3.3) and is a very small estimate. 

To complete this case we need to provide an estimate in L'^. Here, we can be quite inefficient. 
By Cauchy-Schwartz and the Littlewood-Paley inequalities, 



Reu 



'^{b,Va{X,R)){b,VR)v^^X,R)VR ^ \^\{b, Va(X,R))Va(X,R)\'^ \^\{b:VR)vR\ 



Reu 



Reu 



1/2 



^ A. 



This follows directly from the BMO assumption on h. Our proof is complete. 



A A Remark on Journe's Lemma 



Let U be an open set of finite measure in the plane. Let TZ*{U) be those maximal dyadic 
rectangles in TZ that are contained in U. Define for each < 5 < 1 and i,j G {1, 2}, 

and Vs ^ Vs,i,2 U Vs,2,i. For each R e n{U) set 

IJi.5{R) = sup{// >Q : jiRd Vs). 
The form of Journe's Lemma we need is 
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A.l. Lemma. For each < 5, e < 1 and each open set U in the plane of finite measure, 

J2 ^is{Rr'\R\ :< \U\. 

Re-R*{U) 

Journe's Lemma is the central tool in verifying the Carleson measure condition, and points 
to the central problem in two dimensions: That there can be many rectangles close to the 
boundary of an open set. 

Among the references we could find in the hterature [6, 9], the form of Journe's Lemma cited 
and proved is relative to a strictly larger quantity than the one we use, iJ,s{R) above. To 
define it, for any rectangle R, denote it as a product of intervals i?i x i?2. Set M to be the 
strong maximal function. Then, for open set U of finite measure and R e TZ{U), set (taking 
5 — 1/2 for simplicity) 

u{R) = sup{i/ > : X i?2 C {Mlu > 1/2}}. 

Thus, one only measures how deeply R is in the enlarged set in one direction. The Lemma 
above then holds for i^(-R), with however a slightly sharper form of the sum than we prove 
here. 

In addition, note that one measures the depth of R with respect to a simpler set, {Mlu > 
1/2}. We did not use this simplification in our proof as the strong maximal function M does 
not act boundedly on of the plane. 

There are however examples which show that that the quantity z^(-R) can be much larger 
than n{R). Indeed, consider a horizontal row of evenly spaced squares. For a square R in 
the middle of this row, i'{R) will be quite big, while will be about 1 for all R. Thus 
we give a proof of our form of Journe's Lemma. 

Proof of lemma A.l. We can assume that l/2<5<las the terms HsiR) decrease as S 
increases. Fix // > 1. Set S to be those rectangles in TZ*{U) with fi < IjLs{R) < 2//. It suffices 
to show that 

^|i?|^(l + log/x)2|C/|. 
Res 

For then this estimate is summed over /x G {2*^ : A; e Z}. 
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In showing this estimate, we can further assume that for all R, R' G S. writing R = RiX R2 
and likewise for R', that if for j = 1, 2, \Rj\ > \R'j\ then \Rj\ > 16/x(l — 7)~^|i?^-|, where we set 
7 = 5^^^. This is done by restricting log2|i?j| to be in an arithmetic progression of difference 
~ log/x. This neccessitates the division of all rectangles into ^ (1 + log/x)^ subclasses and 
so we prove the bound above without the logarithmic term. 

We define a "bad" class of rectangles B = B{S) as follows. For j — 1, 2, let Bj{S) be those 
rectangles R for which there are rectangles 

i?^, i?^, . . . , R^ G <S — {-fi'}') 



so that for each 1 < k < K, \R'^\ > \Rj\, and 



K 



k=l 



> 7\R\- 



Thus it! e Bj if it is nearly completely covered in the jth direction of the plane. Set 
B{S) — Bi{S)UB2{S). It follows that ii R ^ ^{^): it is not covered in both the vertical and 
horizontal directions, hence 



Rn Pi {R'y 

R'eS-{R} 



>{l-^f\R\. 



And, since all R G U, it follows that 

5] |it:|<(i-7nc/|. 



ReS-B 



Thus, it remains to consider seperately the set of rectangles Bi{S) and B2{S). Observe 
that for any collection S', Bj{S') C S' as follows immediately from the definition. Hence 
Bi{B2{Bi{S))) C Bi{Bi{S)). And we argue that this last set is empty. As our definition of 
Vs and is symmetric with respect to the coordinate axes, this is enough to finish the 
proof. 



We argue that Bi{Bi{S)) is empty by contradiction. Assume that R G B'. Consider those 
rectangles R' in Bi{S) for which (i) \R[\ > \Ri\ and (ii) R'nR^H). Then 



Rn\J R' 
R'ec 
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Fix a one of these rectangles R' with \R'i\ being minimal. Wc then claim that 8/iit!' C 
{M.{^{M2'i-U>5} > which contradicts the assumption that ^x{R') is no more than 2//. 

Indeed, all the rectangles in Bi{S) are themselves covered in the first coordinate axis. We 
see that the the set {M.2^u > 7^} contains the rectangle R!{ x 7~^-R2, in which R2 is the 
second coordinate interval for the rectangle R and R'l is the dyadic interval that contains 
R[ and has measure 8^1(1 - l)~^\R'i\ < \R'I\ < 16/^(1 - -f)~^\R'i\. 

But then the rectangle ^~^R'(x^~^R2 is contained in {M.il{M2iu>'y^} > 7^}- since S/iR' 
is contained in this last rectangle, we have contradicted the assumption that ii{R') < 2/1. 

□ 
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